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Abstract. In this survey, we discuss some basic problems concerning random 
matrices with discrete distributions. Several new results, tools and conjectures 
will be presented. 



1. Introduction 

Random matrices is an important area of mathematics, with strong connections 
to many other areas (mathematical physics, combinatorics, theoretical computer 
science, to mention a few). 

There are two types of random matrices: continuous and discrete. The continuous 
models have an established theory (see [38], for instance). On the other hand, the 
discrete models are still not very well understood. In this survey, we discuss a few 
basic problems concerning these models. The topics to be discussed are: 

• The limiting distribution of the spectrum (Section 3). 

• The spectral norm and the second largest eigenvalue (Sections 4, 5). 

• Determinant (Section 6). 

• Rank and Singular probability (Sections 7, 8). 

• The condition number (Section 9). 

• Tools from additive combinatorics (Sections 10, 11, 12). 

Notations. We denote by M„ the n hy n random matrix whose entries are i.i.d 
Bernoulli random variables (taking values 1 and —1 with probability 1/2). This 
matrix is not symmetric. Symmetric matrices often come from graphs. We denote 
by Q{n,p) the adjacency matrix of the Erdos-Renyi random graph G{n,p). Thus 
Q(n, p) is a random symmetric matrix whose upper diagonal entries are i.i.d random 
variables taking value 1 with probability p and with probability q = 1—p. Another 
popular model for random graphs is that of random regular graphs. A random 
regular graph Gn,d is obtained by sampling uniformly over the set of all simple 
d-regular graphs on the vertex set {1, . . . , n}. The adjacency matrix of this graph 
is denoted by Qn,d- 
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In the whole paper, wc assume that n is large. The asymptotic notation is used 
under the assumption that n ^ oo. We write A <^ B ii A = o{B). c denotes a 
universal constant. All logarithms have natural base, if not specified otherwise. 



2. The universality principle 

Intuitively, one would expect a universal behavior among random models of the 
same object. For random matrices in particular, one would expect the distributions 
of specific eigenvalues be the same (after a proper normalization), regardless the 
model. Thus, given a theorem for continuous models, it is often simple to come 
up with a reasonable conjecture for discrete ones. For instance, there are fairly 
accurate tail estimates for the smallest singular value of a random matrix whose 
entries are i.i.d Gaussians (see for example Theorem 9.2 in Section 9). It would 
be natural to try to prove similar estimates for a random matrix whose entries 
are i.i.d Bernoulli. However, this kind of task is usually challenging, as the tools 
developed for continuous models are typically not applicable in a discrete setting. 
In the last few sections (Sections 10, 11, 12) of this survey we will present new 
tools developed recently in order to treat the discrete models. These tools, among 
others, reveal an intriguing connection between the theory of random matrices and 
additive combinatorics. 

For random graphs, there is a specific conjecture which establishes the universality 
between the two models G{n,p) and Gn,d (Erdos-Renyi graphs and random regular 
graphs). 

Conjecture 2.1. (Sandwich Conjecture) [28] For d ^ logn, there is a joint dis- 
tribution (or coupling) on random graphs H,Gn,d,G such that 



• H has the same distribution as G{n,pi) where pi = ^(1 — ^^^"""^ " ) and G 
has the same distribution as G{n,p2) where p2 = ^(1 + £i/^i^). 

• P(FcGd)-l-o(l). 

• P(G„,d c G) = 1 - 0(1). 

The conjecture asserts that a random regular graph can be approximated from both 
below and above by Erdos-Renyi graphs of approximately the same densities. The 
conjecture has been proved for d < n^/^~°(^) [28]. 

Theorem 2.2. The sandwich conjecture holds for logn <C <C n^/^/ log^ n. 

The main difficulty when dealing with the random regular graph Gn,d is that the 
(upper diagonal) entries of its adjacency matrix are not independent variables. But 
using Theorem 2.2, one can often deduce information about the spectrum of Gn,d 
using information about the spectrum of G{n,p). 
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3. Limiting distribution 

One of the cornerstones of the theory of random matrices is Wigner's semi-circle law, 
which estabUshed the limiting distribution of a certain class of random symmetric 
matrices [58] . We present here a more general version, due to Arnold [5] . 

Let ttij , I < i < j < n, be i.i.d random variables with common variance one and 
distribution function F{x) such that \x\''dF < oo for all k = 1,2,.... Let 
An be the random symmetric matrix of size n whose upper diagonal entries are 
^ij = ay/2-v/n. Let Ai > • • • > A„ be the (real) eigenvalues of An- Define 

Wn{x) := -\{i\Xi < x}\. 
n 

Let W{x) denote the semi-circle density function 

W{x) := -Vl-x^ 

TT 

for < 1 and W{x) := otherwise. 
Theorem 3.1. (Semi-circle law) With probability one, 

lim Wn{x) = W{x). 

n— ^oo 

In order to prove the semi-circle law, Wigncr introduced the so-called trace method, 
the heart of which is the calculation of the expectation of Trace{A'^) for k = 
1,2,.... This method is useful for many other problems (see Section 4 for example). 

Let us now turn to the special matrix Q{n,p). The entries of Q{n,p) have variance 
(T^ = p{l —p). Dividing each entry of Q(n,p) by a, we obtain a matrix Q'(n,p) 
whose entries have common variance one. However, one cannot apply Theorem 3.1 
directly as the entries of Q'{n,p) do not have bounded moments when p tends to 
zero with n. On the other hand, by applying Wigner's trace method, one can prove 
the following theorem 

Theorem 3.2. [21, 57] There is a constant c such that the following holds. Let 
Ai > • • • > A„ be the eigenvalues of Q{n,p) where p > n and define 

Ri{x) := ^\{i\Xi < 2x^np{l-p)}\. 
Then with probability one, 

lim R\{x) = W{x). 
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A corollary of a general theorem by Guionet and Zeitouni [24] shows that R\{x) and 
many other quantities concerning the spectrum of Q{n,p) are highly concentrated. 

Next we discuss the situation with the random regular graph Gn,d- Define 



and 

for < 1 and W{d,x) := otherwise. A theorem of McKay [39] on the spectrum 
of regular graphs (not necessarily random) implies 

Theorem 3.3. (Distribution of the eigenvalues in random regular graphs with fixed 
degree) For any fixed d the following holds with probability one 

lim Rl{x) = W{d,x). 

n— >oo 

Observe that the limiting distribution W(d, x) in this theorem is not semi-circular 
because of the extra term d'i_^d-i)x'^ ' other hand, it becomes arbitrarily 

close to the semi-circle distribution if d is sufficiently large. Thus it is reasonable 
to conjecture that if d tends to infinity with n, Qn,d follows the semi-circle law. 
However, McKay's proof used Wigner's trace method and relied on the crucial fact 
that the graph has few small cycles. Theorem 3.3 still holds for d = n"^^-*. But 
for d = n'^ with any constant e > the graph has too many small cycles and it 
seems very hard to apply this method. On the other hand, using the sandwiching 
theorem (Theorem 2.2), Vu and Wu [57] proved that if logn <C d <C n^/'^/log^ n 
then with probability one 

lim Rl{x) = W{x). 

n— ^cx) 

If the sandwich conjecture holds for all d ^ logn, then this statement can be 
extended for all d ^ logn. Recently, Zeitouni (private communication) suggested 
to the author another approach that also seems to work for a wide range of d. The 
details will appear in [57]. For results concerning more general models of random 
graphs, see [11, 57]. 

To conclude this section, let us briefly discuss the case when An is not symmetric. 
In this case, the eigenvalues are complex numbers. Let Oy, I < i,j < n, be 
i.i.d complex random variables with mean zero and variance one. Let An be the 
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random matrix with entries — aij / sjn and let Ai , . . . , A„ be the eigenvalues of 
An. Consider the two-dimensional empirical distribution 



tJ'n{x,y) := -\{i\Re{Xi) < x,Im{Xi) < y}\. 
n 

It was proved by Girko [22] and Bai [6] that under some weak conditions, fin{x,y) 
tends to the uniform distribution over the unit disc. 

Theorem 3.4. (Circular law) Assume that the have finite sixth moment and 
the joint distribution of the real and imaginary part has bounded density. Then with 
probability one iin{x,y) tends to the uniform distribution over the unit disc in R^. 

4. The spectral norm 
The spectral norm of an n by n matrix A is defined as 



ll^ll = sup \Av\. 

v(iR",\\v\\ = \ 

If A is symmetric, then |m| is the largest eigenvalue of A (in absolute value). 

Wc consider the following general model of random symmetric matrices. Let 
^ < i < j < n, he independent (but not necessarily identical) random variables 
with the following properties 

• |6j I < if for alll < i < j < n. 

• ^{iij) = 0, for all 1 < i < j < n. 

• Var(^,j) = cr^, for all 1 < z < j < n. 

For a moment, let us assume that a and K are positive constants. 

Define = and consider the symmetric random matrix An = Notice 
that for any matrix A, 

\\A\\= lim Tr(A'=)V'=. 

k — >oo 

This suggests that the trace method would be an effective tool for bounding ||^n||- 
Indeed, all upper bounds mentioned below are based on this method. Fiiredi and 
Komlos [21] proved that a.s. 

\\An\\ < 2cr\/n + cn^/^lnn. 
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The error term cn^^^lnn was recently improved to cn^^^lnn by Vu [56]. From 
below, Alon, Krivelevich and Vu [4] showed that a.s. 



2ay/n - clnn < \\A., 




for some constant c. Putting these bounds together, we have 

Theorem 4.1. For a random matrix A as above there is a positive constant c = 
c{S, K) such that 



holds almost surely. 

In many situations a and K may depend on n. A typical example is when A is the 
"normalized" adjacency matrix of G{n,p) (1 and are replaced by 1 — p and —p, 
respectively; this forces all entries to have mean zero) where p is decreasing with 
n {p = n~', say). In this case, by following the proof of the upper bound in the 
previous theorem, one can obtain 

Theorem 4.2. [56] There are constants c and c' such that the following holds. 

Let ^ij, 1 < i < j < n be independent random variables, each of which has mean 
and variance at most and is bounded in absolute value by K, where a > 
c'n~^/'^K\T? n. Then almost surely 



One can also obtain a somewhat weaker bound by following the proof from [21]. 

If one assumes more about the distributions of the entries ^jj , one can obtain a 
sharper bound. Soshnikov and Sinai [46] proved the following 

Theorem 4.3. (Spectral bound for random matrices with symmetric entries) Let 
An be a random symmetric matrix whose upper diagonal entries < i < j < n 
are independent random variables satisfying 

• ha,ve symmetric distribution. 



2ay/n-clnn< \\A\\ < 2aVn + cn^^'^lnn. 



\\An\\ < 2aV^ + c{Kay^^n^^^lnn. 




• For all m>l, E(^?/") = 0(m) 



Then a.s. = 2y/fi + 0{n-^^^). 



In certain algorithmic applications, it is useful to have a tail distribution for the 
spectral norm (see [1, 32], for instance). Using Talagrand's inequality, one can show 
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Theorem 4.4. [32, 4] ( Concentration of the spectral norm) There is a positive 
constant c = c{K) such that for any t > 



P( ||A„||-E(||A„||) > ci) < 4e-*'/=^2. 



Similar results hold for larger classes of random matrices and also for other eigen- 
values (see [4, 40]). 

In the case of Q{n,p), Hp is sufficiently large, then all rows have a.s. roughly np 
ones and the norm of Q{n,p) is (a.s.) (1 + o{l))np. But if p is relatively small, this 
is no longer true. Krivelevich and Sudakov [30] proved that ||(5(n,p)|| is almost 
surely 



(1 + o(l)) max{np, \/D} 

where D denotes the maximum degree of the (random) graph. See [47, 26] for more 
results of this type. 



5. The second eigenvalue of random regular graphs 

Let G be a graph on n points and A its adjacency matrix. Let Ai > A2 > • • • > A„ 
be the eigenvalues of A. If G is d-regular, then Ai = d. In this case, a critical 
parameter of the graph is 

A(G) :=max{|A2|,|A„|}. 

In the literature, A(G) is frequently called the second eigenvalue of G. (This name 
is inaccurate but somewhat convenient.) The good way to think of A(G) is 



A((j) = max {v^ ■ Av\, 

llull = l,i;-l=0 

where 1 is the all ones vector. One can also think of \{G) as the spectral norm of 
the "normalized" adjacency matrix of G, where 1 and are replaced by {n — d)/n 
and —d/n, respectively. 

One can derive many interesting properties of the graph G from the value of A(G). 
The general phenomenon here is that if A(G) is significantly less than d, then the 
edges of G distribute like those of a random graph with edge density d/n [3, 49, 10]. 
One can use this information to derive various properties of the graphs (see [31] for 
many results of this kind). The whole concept can be generalized for non-regular 
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graphs. In this case, one needs to consider the Laplacian rather than the adjacency 
matrix (see, for example, [9]). 

Estimating A(G) for a random regular graph is a well known problem in the discrete 
math/theoretical computer science community. A consequence of the well known 
Alon-Boppana bound [2] asserts that if d is fixed and n tends to infinity, a.s. 

A2(G„,d) > 2Vd^-o{l). 
Since X{G) > \X2{G)\ it follows that a.s. 

A(G„,d) > 2Vd^-o{l). 
Alon [2] conjectured that for any fixed d, a.s. 

A2(G„,d) = 2\/rf^+o(l). 

Friedman [18] and Kahn and Szemeredi [20] showed that if d is fixed and n tends 
to infinity, then a.s. X{Gn,d) = 0{\fd). Recently, Friedman, in a highly technical 
paper [19], proved Alon's conjecture. In fact, he proved the stronger statement 
that a.s. X{Gn,d) = 'i.^/d — 1 + o(l). This, together with the lower bound above, 
determines the asymptotic of A(G„,d). 

Theorem 5.1. [19] (Second eigenvalue of random regular graphs with fixed degree) 
For any fixed d and n tending to infinity, a.s. 

A(G„,d) = (2 + o(l))x/d^. 



A d-regular graph G is Ramanujan if A(G) < 2\/ d — \. Explicit constructions of 
Ramanujan graphs are highly non-trivial and usually come from deep results in 
number theory (sec [35] or [36], for example). On the other hand, the following 
conjecture has been circulated in the last few years (mentioned to the author by 
Sarnak) 

Conjecture 5.2. For d fixed and n tends to infinity, Gn,d is Ramanujan with 
positive constant probability. 

So far, we discussed the case when d is a constant. What happens if d also tends 
to infinity with n? It is not clear (at least to the author) that Friedman's proof of 
Alon's conjecture in [19] can be extended to this case. On the other hand, it is not 
hard to show that X{G{n,p)), where G{n,p) is the Erdos-Reyi random graph, is 
{2+o{l))\/np{l — p) for sufficiently largep (e.g., p > n~^+'^ for any fixed < e < 1). 
Motivated by the universality principle, we make the following conjecture 
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Conjecture 5.3. Assume that d < n/2 and both d and n tend to infinity. Then 
a.s 



X{Gn,d) = (2 + o(l))Vrf(l-rf/n). 

Nilli [41] showed that for any rf-regular graph G having two edges with distance at 

least 2k + 2 between them A2(G) > 2^^~T - 2y/d~T/{k + 1). Ud = n"'^^^ then 
Gn,d has diameter w(l) with probability 1 — o(l). Thus in this case 



A(G„,d)>A2(G„,d)>(2 + o(l))Vd 

with probability a.s. This proves the lower bound in Conjecture 5.3. For a general 
d, it is easy to show (by computing the trace of the square of the adjacency matrix) 
that any rf-regular graph G on n vertices satisfies 



A(G) > ^/d{n-d)/{n-l) « ^/d{l - d/n). 

(We would like to thank N. Alon for pointing out this bound.) 

Let us now turn to the upper bound. For d = o(n^/^), one can follow the Kahn- 

Szcmcrcdi approach to show that \{Gn,d) = 0{Vd) a.s. For larger d, there is a 
weaker bound o{d) [33, Theorem 2.8] proved by the trace method. The following 
two approaches look promising: 

• (Suggested by Krivelevich) Combine the sharp concentration resiilt in the 
previous section with the probability that a random graph is regular. Using 
this, one can show for example that A(G„^d) = 0{\/dlogn) for d close to n 
{d = n/2, for instance). 

• The Sandwich Theorem (Theorem 2.2) implies 

A(C?„,d) = XiG{n, d/n)) + 0{^Jd\ogn). 

For most values of d, X{G{n,d/n)) = 0{Vd). Thus, if the Sandwich 
conjcctinc holds, it would imply a upper bound of 0{\/dlog n) for most 
values of d. 

The author feels confident that one can prove that X{Gn,d) = 0{^/dlogn) for all d 

using these approaches. However, removing the log term seems non-trivial. In fact, 
even the following special and weakened case already looks challenging 

Problem. Prove that A(G„_„/2) = 0{^/n) almost surely. 
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6. Determinant 

The problem of determining the determinant of M„ has been considered by various 
researchers for at least 40 years. It was proved by Komlos in 1967 [29] that almost 
surely detM„ is not zero. In fact, it is easy to see that detM„ is divisible by 2"~^, 
thus it follows that a.s. |detM„| > 2"~^. From above, Hadamard's inequality 
implies that | detM„| < n"/^ (notice that all row vectors of M„ have length ^Jn. 
It was often conjectured that with probability close to 1, | detM„| is close to this 
upper bound. 

Conjecture 6.1. Almost surely \ detM„| = n(^/^~°(^))". 

This conjecture is supported by the following observation of Turan, whose proof is 
a simple application of the linearity of expectation. 

Fact 6.2. 

E((detM„)2) =n\. 

It follows immediately by Markov's bound that for any function ui{n) tending to 
infinity with n, almost surely 

I detM„| < a;(n)\/n!. 

Tao and Vu [50] established the matching lower bound, which confirms Conjecture 
6.1. 

Theorem 6.3. Almost surely 

|detM„| > Vnl exp(- 29 v'n log n) . 

We are going to sketch the proof very briefly as it contains a useful lemma. For a 
more detailed proof, we refer to [50]. 

Proof We view | det M„ | as the volume of the parallelepiped spanned by n random 
{— 1, 1} vectors. This volume is the product of the distances from the (d+l)st vector 
to the subspace spanned by the first d vectors, where d runs from to n — 1. We 
are able to obtain very tight control of this distance (as a random variable), thanks 
to the following lemma. 

Lemma 6.4. Let W be a fixed subspace of dimension 1 < < n — 4 and X a 
random ±1 vector. Then 

'E{dist{X,Wf)=n-d. (1) 

Furthermore, for any t> Q 



P(|dist(X, W) - >t+l)<A exp(-iVl6). (2) 
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Observe that in this lemma, we do not need to assume that W is spanned by 
random vectors. The lemma, however, is not applicable when d is very close to 
n as it does not imply that the distance is positive almost surely. In this case, 
we do need to use the assumption that W is random. This assumption allows 
us to derive information about the normal vector of W, which, combined with 
Erdos-Littlewood-Offord bound (see Theorem 10.1), provides control of the last 
few distances. ■ 

Remark 6.5. After having written [50], Tao and the author discovered that Girko 
(Section 6 of [23]) claimed a very general theorem which implies Theorem 6.3. We 
are not able to understand his proof and have not found anyone who does. 

Theorem 6.3 can be extended for much more general models of random matrices. 
Let ^ij, 1 < i,j < n, be independent (but not necessarily i.i.d.) r.v's with the 
following two properties: 

• Each has mean zero and variance one. 

• There is a constant K that < K with probability one. 

Theorem 6.6. Consider the random matrix with entries ^ij as above. Let e 
be an arbitrary positive constant. With probability 1 — o(l), 

|detM;| > V^.exp{-n^/^+'). 

In certain situations, the assumption that are bounded from above by a con- 
stant is too strong. We are going to consider the following less restricted model. Let 
^iji 1 < J ^ be i.i.d. random variables with mean zero and variance one. As- 
sume furthermore that their fourth moment is finite. Consider the random matrix 
M„ with as its entries. 

Theorem 6.7. [50] We have, with probability 1 — o(l), that 

IdetM^'l > n(i/2-o(i))n. 

An open problem concerning determinants is to extend Theorem 6.3 to symmetric 
matrices. Let Q„ denote the random symmetric matrix whose upper diagonal 
entries are i.i.d. Bernoulli random variables. 

Conjecture 6.8. Almost surely, |detO„| = n(i/2+o(i))n_ 

It was proved only very recently [13] that a.s | det Q„| is positive (which can be seen 
as the symmetric version of Komlos' theorem mentioned above). The main difficulty 
here is that the row vectors of Qn are no longer independent and so Lemma 6.4 is 
not applicable as there is a correlation between the subspace W and the vector X. 

Finally, let us briefly discuss the situation with the permanent. Notice that the 
estimate in Fact 6.2 is still valid for the permanent of M„. Thus, one would expect 
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that, like the determinant, the permanent of M„ is typically of order 71'^"'^/^"'"°*^"'^^^" 
(in absolute value). However, the following problem is still open. 

Problem. Show that the permanent of Mn is a.s. not zero. 

7. Rank and singular probability: non-symmetric models 

Let us consider the basic model Mn and let p„ be the probability that M„ is 
singular. Estimating p„ is well known problem in discrete probability. From below 
it is clear that p„ > (1/2 + o(l))", as a matrix is singular if it has two equal rows. 
A famous conjecture in the field asserts that this trivial lower bound is sharp. 

Conjecture 7.1. p„ = (l/2 + o(l))". 

There is a refined version of the above conjecture where the right hand side is more 
precise (see [27]). However, Conjecture 7.1, as formulated, is still open. 

It is already non-trivial to show that p„ = o(l). As mentioned in the previous 
section, this was done by Komlos almost fourty years ago [29]. The bound on 
Pn in his original proof tends very slowly to zero with n. Later, he found a new 
proof which showed p„ = 0(n~^/^). In 1995, a breakthrough by Kahn, Komlos and 
Szemeredi [27] yielded the first exponential bound p„ = 0(.999"). Their arguments 
were simplified by Tao and Vu in 2004 [50], resulting in a slightly better bound 
0(.958") and a somewhat simpler proof. Shortly afterwards, Tao and Vu [51] 
combined the approach from [27] with ideas from additive combinatorics to obtained 
the following more significant improvement 

Theorem 7.2. [51] p{n) < (3/4 + o(l))". 

The proof in [51] is highly technical and requires many tools from discrete Fourier 
analysis, additive combinatorics and the geometry of numbers. On the other hand, 
the proving scheme is flexible and can be adapted to other models, sometime yield- 
ing (surprisingly) sharp bounds. Let us present one such result. Instead of M„ we 
considc^r the following "lazy" model Af'"^'-'. The entries of Af^"^^ are i.i.d random 
variables which equal zero with probability one half and 1 and — 1 with probability 
one quarter. (If one thinks of the entries of Af„ as fair coin flips, then in the "lazy" 
model about half of the time we are lazy and simply write zero instead flipping a 
coin.) It is clear that for the lazy model the singular probability p^^'" is again at 
least (1/2 + 0(1))" (which is the probability that there is a zero row). We are able 
to show that this bound is actually sharp 

Theorem 7.3. [55] pjf^^' = (1/2 -|- o(l))". 

Let us conclude this section by two conjectures motivated by studies from random 
graphs. These questions concern the resilience of a structure, introduced in [28, 48] . 
Roughly speaking, the resilience of a structure S with respect to a property V 
measures how much we have to change S in order to destroy V. We would like to 
measure the resilience of Af„ with respect to the property of being non-sigular. 
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Given {—1, 1} matrix M, we denote by Res{M) the minimum number of entries 
we need to switch (from 1 to —1 and vice versa) in order to make M singular. If 
M is a sample of M„, it is easy to show that Res{M) is, a.s, at most (l/2 + o(l))n, 
as we can, a.s, change that many entries in the first row to make the first two rows 
equal. We conjecture that this is the best one can do. 

Conjecture 7.4. Almost surely Res{Mn) = (1/2 + o(l))n. 

A closely related question (motivated by the notion of local resilience from [48]) is 
the following. Call a {—1,1} (n by n) matrix M good if all matrices obtained by 
switching (from 1 to —1 and vice versa) the diagonal entries of M are non-singular 
(there are 2" such matrices). 

Conjecture 7.5. Almost surely Mn is good. 



8. Rank and singular probability: symmetric models 

Let us now consider symmetric matrices. The symmetric counterpart of M„ is Q„. 
In fact is is more convenient to consider Q{n, 1/2) instead of Qn, as the graph 
terminology is more convenient and leads to natural extensions. (It is easy to show 
that if Q{n, 1/2) is a.s. non-singular then Qn is and vice versa.) 

While the non-singularity of M„ has been known for forty years since [29], that 
of G(n, 1/2) was established only recently by Costello, Tao and Vu [13]. This 
confirmed a conjecture of B. Weiss (this conjecture was communicated to the author 
by G. Kalai and N. Linial). 

Theorem 8.1. Q{n, 1/2) is a.s non-singular. 

As pointed out earlier in Section 6, the main difficulty in going from the non- 
symmetric setting to the symmetric one is that the row vectors in a random sym- 
metric matrix are not independent. The key tool that helped us to overcome this 
difficulty was the so-called quadratic Littlewood-Offord inequality (Theorem 10.2), 
discussed in Section 10. 

It is natural to ask if Theorem 8.1 still holds for a smaller density p. The answer is 
negative after a certain threshold. Indeed, if p < (1 — e) log n/n for some positive 
constant e, then G{n,p) has a.s. isolated vertices which means that its adjacency 
matrix has all zero rows and so is singular. Costello and Vu proved that log n/n is 
the right threshold. 

Theorem 8.2. [12] For any constant e > 0, Q(n, (1 -|- e)logn/n) is a.s. non- 
singular. 

It remains an interesting problem to estimate the rank of Q(n,p) for p < log n/n. 
The answer here is not yet conclusive, but some partial results are known. For 
instance, it was shown in [12] that if p > (1 + e) logn/2n, then the rank of Q{n,p) 
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is a.s. equal n minus the number of isolated vertices in the graph. (The upper 
bound is trivial.) 

Let us conclude this section by stating two conjectures. The first is a variant of 
Conjecture 7.1. We denote by pf}'"^ the probabiUty that Q{n, 1/2) is singular. It is 
easy to show that this probability is at least (1/2 + o(l))". 

Conjecture 8.3. p^?'™ = (1/2 + o(l))". 

This conjecture is perhaps very hard. The current best upper bound on pf^'^ is 
^-i/4+o(i) seems already non-trivial to replace 1/4 by an arbitrary constant C. 

The second conjecture concerns random regular graphs. If d = 2, Gn,d is a union of 
disjoint cycles and it is easy to show that its adjacency matrix Qn,d is a.s. singular, 
as many of these cycles have length divisible by 4. We conjecture that this is the 
only case. 

Conjecture 8.4. For all d>3, Qn,d is a.s. non-singular. 



9. The condition number 

For a matrix M the condition number c{M) is defined as 

c(M) := ||(M)||.||(M-i)||. 
We adopt the convention that c{M) is infinite if M is not invertible. 

The condition number plays a crucial role in applied linear algebra. In particular, 
the complexity of any algorithm which requires solving a system of linear equations 
usually involves the condition number of a matrix [7] . Another area of mathematics 
where this parameter is important is the theory of probability in Banach spaces (see 
[43] and the references therein). 

The condition number of a random matrix is a well-studied object (see [14] and the 
references therein). In the case when the entries of M are i.i.d Gaussian random 
variables (with mean zero and variance one), Edelman [14] proved 

Theorem 9.1. Let Nn be anxn random matrix, whose entries are i.i.d Gaussian 
random variables (with mean zero and variance one). Then E(lnc(7V„)) = Inn + 
c + o{l), where c> is an explicit constant. 

In applications, it is usually useful to have a tail estimate. It was shown by Edelman 
and Sutton [15] that 

Theorem 9.2. Let Nn be an by n random matrix, whose entries are i.i.d Gaussian 
random variables (with mean zero and variance one). Then for any constant ^ > 0, 

P(c(Ar„) > n^+') = OA{n-^). 
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With the universahty principle, it is reasonable to conjecture that this estimate 
holds for the random Bernoulli matrix M„ as well (see [45] for an even more precise 
conjecture). However, this seems very hard to prove. On the other hand, the 
following was obtained recently by Tao and Vu [52] 

Theorem 9.3. For any positive constant A, there is a positive constant B such 
that 

P(c(M„) > n^) < n"^. 

It is well known that there is a constant C such that the norm of M„ is at most 
Cn}/"^ with exponential probability 1 — exp(— 0^(n)) (in fact, one can prove this 
using the results in Section 4). Thus, Theorem 9.3 reduces to the following lower 
tail estimate for the norm of M~^: 

Theorem 9.4. For any positive constant A, there is a positive constant B such 
that 

P(||M-i|| >n-^) <n-^. 

Shortly prior to Theorem 9.3, Rudelson [43] proved the following result. 

Theorem 9.5. There are positive constants ci,C2 such that the following holds. 
For any e > cin~^/^ 

P{\\M-'\\>C2en'/')<e. 

Both theorems can be generalized considerably (see [52, 53] and [43]). Theorem 
9.3 in particular still holds if we replace M„ by M + M„ where M is an arbitrary 
matrix with polynomially bounded norm. 

Theorem 9.6. [53] For any positive constants A and C, there is a positive constant 
B such that the following holds. For any n by n matrix M where \\M\\ < n^, 

P(c(M + M„) > n^) < n-^. 

The point here is that M itself can have very large condition number. (In fact 
if M is singular then its condition number is infinity.) Theorem 9.6 asserts that 
a Bernoulli perturbation of M has small condition number with high probability. 
The Gaussian version of Theorem 9.6 was proved by Spielman and Teng in [45]. 
For the connection of these theorems to numerical linear algebra and theoretical 
computer science, we refer to [45] and [53]. 

10. Littlewood-Offord and quadratic Littlewood-Offord 

Let V = {ui, . . . , Vn} be a set of n integers and ^i, . . . , be i.i.d random Bernoulli 
variables. Define S := Y^^=i ^i'^i ^^'^ Pv(a) := P{S = a) and pv ■= suP(jgzPv(a)- 
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Erdos, answering a question of Littlewood and Offord, proved the following theorem, 
which we are referring to as the Erdos-Littlewood-Offord inequality. 

Theorem 10.1. Let wi, . . . be non-zero numbers and be i.i.d Bernoulli ran- 
dom variables. Then 



Theorem 10.1 is a classical result in combinatorics and has many non-trivial exten- 
sions (see [54, Chapter 7] or [25] and the references therein). 

The random sum 5 := ^i'^i pl^ys a central role in the study of random 

Bernoulli matrices. In many problems (such as that of the determinant, rank or con- 
dition number) , the critical parameter is the distance from a random Bernoulli vec- 
tor to the hyperplane spanned by another n—1 random Bernoulli vectors (these are 

the row vectors of Mn). If (?'i, . . . , v,,,) is the (unit) normal vector of the hyperplane, 
then this distance is exactly the absolute value of the random sum S = Y^^^i^iVi- 

Bounding the above mentioned distance is one of the main difficulties when one 
goes from Gaussian matrices to Bernoulli matrices. If the entries of the matrix are 
Gaussian, then the distance in question is a simple object. Thanks to symmetry, 
the position of the hyperplane docs not really matter and so wc can condition on 
it. Furthermore, the distribution of the distance from a random Guassian vector 
to a fixed hyperplane is well understood. The situation in the Bernoulli case is 
very different. In this case, the random vectors arc chosen from the vertices of the 
n-dimensional {—1,1} cube. Very little is known about the hyperplanes spanned 
by n — 1 such vectors. Let us point out, however, that there are planes that contain 
a constant fraction of the vertices of the {—1,1} cube and in this case the distance 
in question is zero with constant probability. 

In order to treat random symmetric matrices (in particular Q(n, 1/2)), Costello, 
Tao and Vu [13] introduced the following quadratic version of Theorem 10.1 

Theorem 10.2. Let Cij,l < i,j < n be non-zero numbers. Consider the qua- 
dratic form F = X]i<j<j<„ Cij^i^j, where < i < n are i.i.d Bernoulli random 
variables. Then for any a 



The exponent —1/8 was improved to —1/4 (see [12]). It is conjectured [12] that the 
sharp exponent would be —1/2. The lower bound is given by the quadratic form 



Pv < 




F{F = a) = 0(n-i/8). 
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11. Random walks and Lazy Random Walks 

Let V = {vi, vi, . . . , Vn} be the set of n non-zero numbers and consider the random 
walk W on the real hne (starting at 0) which at step i goes to the left by Vi with 
probability one half and to the right by Vi with probability one half. The probability 
Pv(0) = P(X^"=i ^iVi = 0) is exactly the probability that W returns to the origin 
after n steps. 

Let n he a, constant between and 1 and consider the "lazy" random walk 
(starting at 0) which at step i stays with probability 1 — /i and goes to the left by 
Vi with probability and to the right by Vi with probability Let p^{0) be 
the probability that the lazy walk return to zero after n steps. 

Intuitively, one would expect that p^{0) is larger than pv(0) (especially when /x is 
small), as the lazy walk has a stronger tendency to stay near the starting point. 
Quantitatively, one can show (using Fourier analysis and the elementary fact that 
I cosx| < 3/4 + l/4cos2a;) that for any v 

The next question is: Can one improve this to 



Pv{0) < epy\0), (3) 

for any positive constant e ? The answer is negative. If we take v = {1, 1, . . . , 1}, 
then it is easy to show that 

Pv(0) = (c + o(1))pV'(0) 

where c is a positive constant (depends on 1/4). However, it is possible to classify 
all sets V where (3) fails (nndcr some slight assumptions) . This classification is the 
heart of the proof of Theorem 7.2. The precise statement is somewhat technical 
(we refer to [51] for details), but roughly it says that if (3) fails then v is contained 
in a generalized arithmetic progression with constant rank and small volume. 



12. Inverse Littlewood-Offord theorems 

A set 

P = {c + miai H h mdUdlMi < nii < M-} 

is called a generalized arithmetic progression (GAP) of rank d. It is convenient to 
think of P as the image of an integer box B := {(mi, . . . ,md)\Mi < mi < M/} in 
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Z under the linear map 

$ : (mi, . . . , nid) ^ c + miai H h m<jad- 

The numbers ai are the generators of P. For a set A of reals and a positive integer 
fc, we define the iterated sumset 

kA:= {al^ h afe|aj € A}. 

Let us take another look at Theorem 10.1. This theorem is sharp, as is shown 
by taking wi = U2 = ••• = «« = 1- However, the bound changes significantly if 
one forbid this special case. Erdos and Moser [17] showed that under the stronger 
assumption that the are non-zero and diff'erent, 



Pv = 0(n-^/^lnn). 

They conjectured that the logarithmic term is not necessary and this was confirmed 
by Sarkozy and Szemeredi [46] (see also [25]). Again, the bound is sharp (up to 
a constant factor), as can be seen by taking . . . to be a proper arithmetic 
progression such as 1, . . . ,n. 

In the above two examples, we observe that in order to make pv large, we have to 
impose a very strong additive structure on v (in one case we set the Vi 's to be the 
same, while in the other we set them to be elements of an arithmetic progression). 
A more general example is the following 

Example. Let P be a GAP of rank d and volume V. Let vi,...,Vn be (not nec- 
essarily different) elements of P. Then the random variable S = Y^^^i ^iVi takes 
values in the GAP nP which has volume nf^V . Prom the pigeonhole principle and 
the definition of pv, it follows that 

This example shows that if the elements of v belong to a GAP with small rank 
and small volume then pv is large. One may conjecture that the inverse also holds, 
namely, 

If Pw is large, then (most of) the elements of v belong to a GAP with small rank 
and small volume. 

Tao and Vu [52] have managed to quantify this statement. 

Theorem 12.1. [52] Let A, a > be arbitrary constants. There are constants 
d and B depending on A and a such that the following holds. Assume that v = 

{fi, ... ,Vn} is a m,ultiset of integers satisfying P(S' = 0) > . Then there is a 
GAP Q of rank at most d and volume at most which contains all hut at most 
n" elements ofv (counting multiplicity). 
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Notice that the small set of exceptional elements is not avoidable. For instance, 
one can add O(logn) completely arbitrary elements to v, and only decrease pv by 
a factor of n~*^^^^ at worst. 

Theorem 12.1 is one of the main ingredients in the proofs of Theorems 9.3 and 9.6. 
For many other theorems of this type, see [52]. 

Acknowledgement. The author would like to thank K. Costello and P. Wood for 
proofreading the manuscript. 
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